Modelling and Mitigation of Soft-Errors in CMOS Processors
نویسنده
چکیده
The topic of this thesis is about soft-errors in digital systems. Different aspects of soft-errors have been addressed here, including an accurate simulation model to emulate soft-errors in a gate-level net list, a simulation framework to study the impact of soft-errors in a VHDL design and an efficient architecture to minimize the impact of soft-errors in a DSP processor. The first two chapters of this thesis introduce the basic knowledge with regard to soft-errors. Chapter three introduces a simulation framework to study the impact of soft-errors in complex digital systems modelled in VHDL language. This framework has been introduced to resolve the enormous CPU time typically required in simulation-based soft-error experiments. Chapter four introduces two realistic simulation models that can emulate the impact of soft-errors in a 45-nm CMOS technology node at a gate level. One of the determination approaches has been extracted from radiation testing along with using a transistor-level soft-error analysis tool. Another approach has been developed by analysing the behaviour of soft-errors in a 45-nm CMOS technology node. In chapter 5, some unique features of DSP processors have been exploited to introduce a low-overhead soft-error mitigation architecture to minimize the impact of soft-errors in a DSP processor. This mitigation technique concerns unstructured parts of a processor (such as the control unit and data path). The unique features of DSP processors are existence of several functional units, a limited number of different opcodes in each functional unit and also highly-repetitive instruction flow in a DSP workload. Moreover, the mitigation method which has been developed for a single core has been applied to a multi-core environment in chapter 6 to propose a soft-error mitigation technique for multi-core architectures. Overall, based on simulated data and experiments, this thesis proposes a methodology to investigate the impact of soft-errors during the design phase of a digital system.
منابع مشابه
Reducing the Cost of Protection against Soft Errors using Profile-Based Analysis
The negative impact of the aggressive scaling of technology nodes on the sensitivity of CMOS devices to soft errors has been well studied in the past. Technology scaling makes processors more susceptible to transient faults. Errors caused by high-energy particle strikes in processors can result in unexpected behavior and incorrect results. With the smaller and cheaper CMOS devices pervasive in ...
متن کاملCost Effective Soft Error Mitigation in Microprocessors
Device scaling has caused the challenges that processor designers face to evolve significantly in the past. This trend will continue into the future, and reliability is emerging as a significant challenge. In this work, we focus on one aspect of the reliability problem: soft errors. In particular, cost effective mitigation of soft errors in processor microarchitecture. Our investigation begins ...
متن کاملA Multi-Level Approach to NBTI Mitigation in Processors
We are in the era of multicore processors and it is expected that the number of the processing cores on a chip will steadily increase over the next decade, driven by Moore’s Law. While technology scaling has benefitted high performance, the scaling has a dark side too: a degradation in the reliability of silicon devices. Processors have become highly susceptible to a variety of reliability prob...
متن کاملDesigning and Evaluating Redundancy-based Soft Error Masking on a Continuum of Energy versus Robustness
Near-threshold computing is an effective strategy to reduce the power dissipation of deeply-scaled CMOS logic circuits. However, near-threshold strategies exacerbate the impact of delay variations on device performance and increase the susceptibility to soft errors due to narrow voltage margins. The objective of this work is to develop and assess design approaches that leverage tradeoffs betwee...
متن کامل’ Introduction RELIABILITY - AWARE MICROARCHITECTURE
It is becoming increasingly difficult to achieve expected levels of reliability and data correctness as the industry approaches the era of extreme CMOS scaling. Aging-related device degradation is becoming a real threat to lifetime reliability. Many processors already include solutions for soft errors in memory structures. More recently, soft errors in logic paths have become an increasing conc...
متن کامل